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Abstract 

Complex networks have attracted increasing interests in almost all disciplines of natural and 
social sciences. However, few efforts have been afforded in the field of chemical engineering. We 
present in this work an example of complex technological network, investigating the process flow of 
an ammonia plant (AP). We show that the AP network is a small-world network with scale-free 
distribution of degrees. Adopting Newman's maximum modularity algorithm for the detection of 
communities in complex networks, evident modular structures are identified in the AP network, 
which stem from the modular sections in chemical plants. In addition, we find that the resultant 
AP tree exhibits excellent allometric scaling. 
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Introduction 

Complex systems are ubiquitous in natural and social sciences. The behavior of complex system as a 
whole is usually richer than the sum of its parts and it is lost if one looks at the constituents separately. 
Complex systems evolve in a self-adaptive manner and self-organize to form emergent behaviors due 
to the interactions among the constituents of a complex system at the microscopic level. The study of 
complexity has been witnessed in almost all disciplines of social and natural sciences (see, for instance, 
the special issue of Nature on this topic in 2001 ? ). However, engineers seem a little bit indifferent as if 
engineering is at the edge of the science of complexity. Ottino argues that "engineering should be at 
the centre of these developments, and contribute to the development of new theory and tools" 7 and 
chemical engineering is facing new opportunities. 7 

The topological aspects of complex systems can be modelled by complex networks, where the 
constituents are viewed as vertices or nodes and an edge is drawn between two vertices if their as- 
sociated constituents interact in certain manners. In recent years, complex networks have attracted 
extensive interests, covering biological systems, social systems, information systems, and technological 
systems. 7,7,7 Complex networks possess many interesting properties. Most complex networks exhibit 
small-world traits 7 and are scale free where the distributions of degrees have power-law tails. 7 In 
addition, many real networks have modular structures or communities. 7 The fourth intriguing feature 
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of some real networks reported recently is the self-similarity. 7 The studies of complex networks have 
extensively broadened and deepened our understanding of complex systems. 

In the field of chemical engineering, chemical reactions and transports of mass, energy and momen- 
tum have been the traditional domains for about five decades, where the topological properties are of 
less concerns. Amaral and Ottino have considered two examples for which the way constituents of the 
system are linked determines transport and the dynamics of the system, that is, food webs and cellu- 
lar networks. 7 In this paper, we present an example of complex technological network in traditional 
chemical engineering, studying the topological properties of the process flow of an ammonia plant. 

The network studied here is abstracted from the process flow diagram of the Ammonia Plant of 
Jiujiang Chemical Fertilizer Plant (Jiangxi Province, China). The scale of the plant is 1000MT/D. 
The process flow diagram with the major equipment is shown in Fig. 1. In the construction of 
the Ammonia Plant network (AP network), towers, reactors, pumps, heat exchangers, and connection 
points of convergence and bifurcation of pipes are regarded as vertices. Only the equipments and pipes 
carrying raw materials, by-products, and products are considered in the construction of network. The 
utility flows are not included in the network. The pipes connecting the vertices are treated as edges. 
The AP network constructed has 505 vertices and 759 edges. 



AP network exhibits small-world effect 

The average minimum path length is among the most studied quantity in complex networks. 7,7 ' 7 
When regarding the AP network as an undirected network, we compute the average minimum path 
length (I) = 7.76 with a standard deviation a\ = 2.65. We find that the distribution of I is Gaussian. 
The skewness is 0.17 and the kurtosis excess is 0.01, which is close to the theoretical value of a 
Gaussian distribution. The average minimum path length and its fluctuation can also be estimated 
by a Gaussian fit to the data, which presents (l) = 7.85 and 07 = 2.74. 

In most small- world networks, the average minimum path length is somewhat larger than that 
for a random graph. 7 It is interesting to compare the average minimum path length of the real 
ammonia plant network with that of model networks. The null model is the maximally random 
networks with the same number of nodes and the same degree sequence as the real network. There 
are several methods for the generation of random graphs with prescribed degree sequences and the 
chain switching method gives accurate results with acceptable computational time, 7 which was used 
in the detection of rich-club structure 7 and is the very null model in the statistical tests of network 
topological properties. 7 Adopting the chain switching method, we have generated 12400 random 
networks. The average minimum path length Z ran d of each model network is calculated. It is found 
that Z r and = 5.90 ± 0.07. What is striking is that the maximum of Z ran d is 6.15, much smaller than 
(I) = 7.76. 

The clustering coefficient C% of vertex i is a measure of the cluster structure indicating how much 
the adjacent vertices of the adjacent vertices of i are adjacent vertices of i. Mathematically, C, is 
defined by 

E- 

Cl = h{h -l)/2 ' (1) 

where E{ is the number of edges among the adjacent vertices of i? The average clustering coefficient 
C = (Cj) is 0.083, which is comparable to other technological networks. 7 Using the same database of 
the maximally random networks, we find that C ran d = 0.0075 ± 0.0036 and the maximum clustering 
coefficient of random networks is 0.025, much smaller than C = 0.083 for the ammonia plant network. 
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This is the evidence supporting that the AP network is a small- world network/ 



AP network is scale-free 

The degree k of a vertex of a network is the number of edges connected to that vertex. Degree 
distributions of vertices are perhaps the most frequently investigated in the literature of complex 
networks. ?,?,? The degree distributions of scale-free networks have fat tails following power laws 

p(k) ~ k-^ + V (2) 

Several mechanisms of scale free distributions have been proposed, such as preferential attachment 
and its variants 7 and fitness of vertices. 7,7 In order to estimate the probability distribution of a 
physical variable empirically, several approaches are available. For a possible power-law distribution 
with fat tails, cumulative distribution or log-binning technique are usually adopted. A similar con- 
cept to the complementary distribution, called rank-ordering statistics, 7 has the advantage of easy 
implementation, no information loss, and being less noisy. 

Consider N observations of variable k sampled from a distribution whose probability density is 
p(k). Then the complementary distribution is P(y > k) = p(y)dy. We sort the n observations in 
non- increasing order such that k± > ki > • • • > k n > ■ ■ ■ > k n , where n is the rank of the observation. 
It follows that NP{k > k n ) is the expected number of observations larger than or equal to k n , that is, 

NP(k >k n )=n . (3) 

If the probability density of variable k follows a power law that p(k) ~ 5 then the complementary 

distribution P(k) ~ k~^. An intuitive relation between k n and n follows 

k n ~ n" 1 ^ . (4) 

A rigorous expression of (4) by calculating the most probable value of k n from the probability that 
the n-th value equals to k n gives 7 



fin + 1 

When fin > 1 or equivalently 1 <C n < N, we retrieve (4). A plot of lnk n as a function of Inn gives 
a straight line with slope —1/fi with deviations for the first a few ranks if k is distributed according 
to a power law of exponent fi. We note that the rank-ordering statistics is nothing but a simple 
generalization of Zipf's law 7 ' 7 ' 7 and has wide applications, such as in linguistics, 7 the distribution of 
large earthquakes, 7 time-occurrences of extreme floods, 7 to list a few. More generally, rank-ordering 
statistics can be applied to probability distributions other than power laws, such as exponential or 
stretched exponential distributions, 7 normal or log-normal distributions, 7 and so on. 

In Fig. 2 is shown the rank-ordering analysis of the in-degree, out-degree and all-degree of the AP 
network in log-log plot. We see that the AP network is scale-free. Linear regression of lnfc n against 
Inn gives the following exponents: 1/fi = 0.419 ± 0.010 for all-degree, 1/fi = 0.407 ± 0.009 for in- 
degree, and 1/fi = 0.443 ± 0.008 for out-degree. Therefore, we have fi = 2.39 ± 0.06 for all-degree, 
fx = 2.46 ± 0.05 for in-degree, and fi = 2.31 ± 0.04 for out-degree. 
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Modular structure in the AP network 



Brief review 

In the recent years, much attention has been attracted to the modular clusters or community structures 
of real networks, such as metabolic networks, 7 ' ? food webs, ? ' ? social networks, 7,7,7 ' 7 to list a few. 

There are rigorous definitions for community. A strong community is defined as a subgraph of the 
network requiring more connections within each community than with the rest of the network, while 
in a weak community the total number of connections of within-community vertices is larger than the 
number of connections of the vertices in the community with the rest of the network. 7,7 However, 
in most cases in the literature, community is only fuzzily defined in the sense that the connections 
within communities are denser than between communities. 

Different types of algorithms have been developed for the detection of communities. 7 Sokal and 
Michener proposed the average-linkage method, 7 which was extended to the hierarchical clustering 
algorithm later. 7 In 1995, Frank developed a method for direct identification of non-overlapping 
subgroups, 7 which was applied to detect compartments in food webs. 7 Girvan and Newman proposed 
a divisive algorithm that uses edge betweenness centrality to identify the boundaries of communities, ' 
which is now widely known as GN algorithm. Based on the concept of network random walking, Zhou 
used dissimilarity index to delimit the boundaries of communities, which was reported to outperform 
the algorithm based on the concept of edge betweenness centrality. 7, 7 An alternative divisive algorithm 
of Radicchi et al. is based on the edge clustering coefficient, related to the number of cycles that include 
a certain edge. ' ' ' Another well-known algorithm is Newman's maximum modularity algorithm, which 
is a type of agglomerative algorithm. 7,7 

Many other algorithms have been presented, for instance, the Kernighan-Lin algorim, 7 the spectral 
method which takes into account weights and link orientations and its improvement, 7,7 the resistor 
network approach which concerns the voltage drops, 7 the information centrality algorithm that consists 
in finding and removing iteratively the edge with the highest information centrality, 7 a fast community 
detection algorithm based on a g-state Potts model, 7 an aggregation algorithm for finding communities 
of related genes, 7 the maximum modularity algorithm incorporated with simulated annealing, 7 the 
agent-based algorithm, 7 the shell algorithm, 7 and the algorithm based on random Ising model and 
maximum flow. " 

Community structure of the AP network 

We apply Newman's maximum modularity algorithm 7,7 to study the community structure of the AP 
network. The resultant AP tree is illustrated in Fig. 3, which is not in the form of dendrogram. 
The shapes of the vertices represent different sections of the process flow of the AP: SGP section-oil 
(solid circles), rectisol section-oil (horizontal ellipses), CO-shift section (vertical ellipses), synthesis 
& refrig. section (open circles), air separation section (triangles), nitrogen washing section (vertical 
diamonds), steam superheater unit (horizontal diamonds), ammonia storage & tank yard (rectangles), 
and equipments of waste treatment (squares). The maximum value of the modularity is Q = 0.794, 
which is among the largest peak modularity values reported for different networks (if not the largest) 
and thus indicates a very strong community structure in the investigated network. 

It has been found that random graphs and scale-free networks have modularity with analytic 
expressions, 7 which allows us to check if the modularity observed in the AP network is mathematically 
significant or not. Since the modularity of a scale- free network with S = 505 nodes and connectivity 
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m = 749/505 is 

<3sF = ( 1 ^)( a + i ^) =a6773 ' (6> 

which is again much smaller than Q = 0.794, showing that the modularity of the AP network is 
significant. Note the a = 0.165 ± 0.009. ? For an Erdds-Renyi random graph with S = 505 nodes and 
connection probability p = 749/(505 x 504/2) = 0.0059, the maximal modularity is 

<3ER= ( 1 -^)( 1 -^) 2/3 = a6995 ' (7) 

The fact that Q is greater than Qer indicates that the modular structure extracted from the AP 
network could not be attributed to the fluctuation of random graphs and is thus still very significant. 

Alternatively, we can use the same null model which employs the chain switching algorithm to 
generate maximally random networks with the same degree sequence of the AP network, we find that 
Qrand = 0.440 ± 0.009 and the maximum of the modularity of model networks is 0.469, which is much 
smaller than Q = 0.794. This test provide further evidence that the modular structure in the AP 
network is statistically significant. 

The modular structures of chemical plant networks do not come out as a surprise. In a chemical 
plant, raw materials are fed into the process flow network and react from one section to another 
successively, although there are feedbacks from later sections. In general, flows are denser within a 
workshop section than between sections. Therefore, a section is naturally a community. In Fig. 3, 
most of the vertices in a given section are recognized to be members of a same community. The 
vertices of the storage and tank yard (rectangles) are the most dispensed in Fig. 3. This is expected 
since these tanks are linked from and to different sections in the process, which shows the power of 
Newman's maximum modularity algorithm for community detection. 



Allometric scaling of the AP tree 

The network shown in Fig. 3 is actually a tree. Trees exhibit intriguing intrinsic properties other than 
non-tree networks, among which is the allometric scaling. Allometric scaling laws are ubiquitous in 
networking systems such as metabolism of organisms and ecosystems river networks, food webs, and 
so forth. •>•'•'•'•>•'• The original model of the allometric scaling on a spanning tree was developed 
by Banavar, Maritan, and Rinaldo. 7 The spanning tree has one root and many branches and leaves, 
and can be rated as directed from root to leaves. Mathematically, each node of a tree is assigned a 
number 1 and two values Ai and Si are defined for each node i in a recursive manner as follows: 

A i = ^2A j + l, (8a) 
j 

and 

s 1 = ^2 s j+ a ^ ( 8b ) 

where j stands for the nodes linked from i? In a food web, i is the prey and j's are its predators (thus 
the nutrition flows from i to j's). The allometric scaling relation is then highlighted by the power law 
relation between Si and A^. 

S-A 71 . (9) 
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For spanning trees extracted from transportation networks, the power law exponent 77 is a measure 
of transportation efficiency. 7 ' ? The smaller is the value of 77, the more efficient is the transportation. 
Any spanning tree can range in principle between two extremes, that is, the chain-like trees and the 
star-like trees. A chain tree has one root and one leaf with no branching. Let's label leaf vertex by 1, 
its father by 2, and so forth. The root is labelled by n for a chain-like tree of size n. The recursive 
relations (8) become Si = Si_i + A{ and Ai = A%—\ + 1 with termination conditions A\ = S\ = 1. It 
is easy to show that Ai = i and Si = i(i + l)/2. Asymptotically, the exponent 77 = 2~~ for chain-like 
trees. For star-like trees of size n, there are one root and n — 1 leaves directly connected to the root. 
We have ^4 = 5 = 1 for all the leaves and A = n and 5 = 2n — 1 for the root. It follows approximately 
that rj = 1 + . Therefore, 1 < rj < 2 for all spanning trees. 

We note that not all trees have such allometric scaling. Consider for instance the classic Cayley 
with n generations where the root is the first generation. The A and S values of the vertices of the 
same generation are identical. If we denote A4 and Si for the vertices of the (n + 1 — i)-th generation, 
the iterative equations are Ai+\ = 2Ai + 1 and S^+i = 2Si + Aj+i, resulting in A, = 2* — 1 and 
Si = (i— 1)2* + 1. This leads to S = [log 2 (^4 + 1) — l]yl + log 2 (j4 + 1). Obviously, there is no power-law 
dependence between A and S. 

We apply this framework on the AP tree. The calculated S is plotted in Fig. 4 as a function of A. 
A nice power-law relation is observed between S and A. A linear fit of lnS" against In A give rj = 1.21 
with regression coefficient 0.998. The trivial point (A = 1,5 = 1) is excluded from the fitting. 7 This 
value of 77 is slightly larger than 77 = 1.13 ~ 1.16 for food webs ? but much smaller than rj = 1.5 for 
river networks.' This analysis is relevant when the flux in the pipes and reactors are considered for 
the investigation of the transportation efficiency, as an analogue to the river network and biological 
network. ' > • 

Concluding remarks 

We have studied a complex technological network extracted from the process flow of the Ammonia 
Plant of Jiujiang Chemical Fertilizer Plant in Jiangxi Province of China. We have shown that the 
ammonia plant network is a small-world network in the sense that its minimum average path length 
(I) = 7.76 and global clustering coefficient C = 0.083 are respectively larger than their counterparts 
^rand = 5.90 ± 0.07 and C ra nd = 0.0075 ± 0.0036 of a semble of 12400 maximally random graphs 
having the same degree sequences of the real AP network. We found that the shortest path lengths 
between two arbitrary vertices are distributed according to a Gaussian formula. The distribution of 
degrees follows a power law with its exponent being ri = 2.31 — 2.46, indicating that the AP network 
is scale-free. 

We have reviewed briefly diverse existing algorithms for the detection of community structures 
in complex networks, among which Newman's maximum modularity algorithm is applied to the AP 
network. The extracted modular structures have a very high modularity value Q = 0.794 signaling the 
significance of the modules, which is confirmed by statistical tests. These modular structures are well 
explained by the workshop sections of the ammonia plant. We have constructed a spanning tree based 
on the community identification procedure and found that the resultant AP tree exhibits excellent 
allometric scaling with an exponent comparable to the universal scaling exponent of food webs. 

In summary, we have studied the topological properties of the AP network from chemical engi- 
neering. More sophisticated networks can be constructed from process flows in chemical industry. 
There are still other open problems even in this small AP network, such as the origin of the scale-free 
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feature, what we can learn from these topological features, robustness and sensitivity analysis on the 
mass flows to find out bottlenecks in the process or figure out how jamming of the nodes or cascade 
failure of the system can occur, to list but a few. We hope that this work will attract more affords 
in this direction. Further researches on complex networks containing information of transports and 
reactions will unveil useful properties and benefit the field practically and theoretically. 
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• Figure LAmmonia plant process flow diagram with the major equipment. 1 - gasification reactor, 
2 - carbon scrubber, 3 - H2S absorber, 4 - humidifier, 5 - shift converter, 6 - dehumidifier, 7 - 
CO2 absorber, 8 - air compressor, 9 - pressure column, 10 - nitrogen compressor, 11 - nitrogen 
wash column, 12 - synthesis gas compressor, 13 - ammonia converter, 14 - unitized chiller, 
15 - liquid ammonia tank. 

• Figure 2: Rank-ordering analysis of the in-degree, out-degree, and all-degree of the AP network. 
We have translated vertically the in-degree line by 4 and the out-degree line by 25 for better 
presentation. The lines are the best fit of tail distribution to (4). 

• Figure 3: (Color online) Modular structure of the AP network. The shapes of the vertices 
represent different sections of the process flow of the Ammonia Plant. This figure was produced 
with Pajek. 7 

• Figure 4: Power-law scaling of S against A. The line represents the power-law fit to the data. 
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Figure 1: Ammonia plant process flow diagram with the major equipment. 1 - gasification reactor, 
2 - carbon scrubber, 3 - H2S absorber, 4 - humidifier, 5 - shift converter, 6 - dehumidifier, 7 - 
C02absorber, 8 - air compressor, 9 - pressure column, 10 - nitrogen compressor, 11 - nitrogen wash 
column, 12 - synthesis gas compressor, 13 - ammonia converter, 14 - unitized chiller, 15 - liquid 
ammonia tank. 
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Figure 2: Rank-ordering analysis of the in-degree, out-degree, and all-degree of the AP network. We 
have translated vertically the in-degree line by 4 and the out-degree line by 25 for better presentation. 
The lines are the best fit of tail distribution to (4). 
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Figure 3: (Colour online) Modular structure of the AP network. The shapes of the vertices represent 
different sections of the process flow of the Ammonia Plant. This figure was produced with Pajek. ? 
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